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Abstract 

A 40-Gb/s CDR circuit incorporates a quarter-rate phase detector and a multiphase LC 
oscUlator to retime the data and demultiplex it into four 10-Gb/s outpats. Fabricated Id 0,18- 
fim CMOS technology, the circuit prtidnees a clocic jitter of 0.9 ps^rms and 9-67 p8,pp with a 
PRBS of 2^^ - 1 while consuming 144 mW. 
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Clock and data recovery (CDR) circuits operating at tens of gigabils per second pose difBcult 
challenges with rcspcci to speed, jitter^ signal distribution, and power consumption. Half-rate 
40-Gb/s CDR circuits have been implemented in bipolar technology [1,2], but they require S-V 
supplies and draw 1.6 to 5 watts of power. (The work in [I] uses an external oscillator and 90** 
phase shifter.) On Oie other hand, the recent integration of 10-Gb/s receivers in CMOS technology 
[3] encourages further research on CMOS solutions for higher speeds, especially if it enables low- 
voltage, low-power realization. This paper presents die design and experimental verificaiion of a 
40-Gb/s phase-locked CDR circuit fabricated in 0- 1 ^-pm CMOS technology. 



A. CDR Architecture 

Shown in Fig. . 1, the circuii incorporates a multiphase voltage-controlled oscillalor CVCO), a 
quarter-rate phase detector (Pp), a voltage-to^urrent (V/I) converter, and a simple loop filter. The 
PD uses the lialf-quadrature phases provided by the VCO to sample the input data every 12.5 p's, 
thereby detecting data edges and determining whether the clock is early or late. Pour of these 
samples fall in the center of the data eye, retiming and demultiplexing the.40-Gb/s input into four 
lO-Gb/s outputs. In the absence of data transitions, the V/I converter generates no output current, 
leaving the oscillator contrxDl line undisturbed. The circuit is fully differential, except for the oscil- 
lator control line. With quarter-rate sampling, the fiipflops* hold time can be four times as long as 
that required in fUIUrate operation, but their acquisition speed must still guarantee correct sampling 
of the input bits in less than 25 ps. 
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Fig. I. CDR architecture. 
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jS. Building Blocks 

VCO The speed, jitter, and driving capability required of the oscillator point to the use of an LC 
realizacion. A number of multiphase LC oscillators have been reported. Coupled oscillators [4, 5] 
operate away from the resonance frequency of the tanks so as to create the required phase shift* 
thus bearing a trade-oif between reHability of osciUation and the phase noise [5] . Fmtherroorc, such 
topologies are proiic to oscillation at more than one frequency because they can satisfy gain and 
phase requirements at multiple frequencies. The multiphase oscUlator in [5] drives transmission 
lines by a gain stage loaded by resistors, incurring energy los? in each cycle. 

The muhipliase oscillator introduced here is based on the concept of differential stimulus of a 
closed'-loop transmission line at equally-spaced points. Xllustraied in Fig. 2, the circuit sustains a 
phase separ^iUon of 180* at diagonally-opposite nodes, providing 45' phase steps in between, Un- 
Ulce the topologies in [5] and p], this circuit does not operate away from the r^onance frequency. 
Moreover, its oscillation frequency is uniquely given by the travel time of the wave around the 
loop. Also, in contrast to the design in [6]» the transmission line requires no termination resistors, 
thereby displaying lower phase noise and larger voltage swings for a given power dissipation and 
inductor Q. 




(a) a» 



Fig. 2, (a) Proposed half-quadrature oscillator, (b) modification of (a), (c) realization of -Gm ceW. 

The topology of Fig. 2(a) nonetheless necessitates long interconnects between the nodes and 
their corresponding -Gm cells. However, recognising that diagonally-opposite inductors cany 
currents fliat are 180** out of phase, wc modify the circuit as shown in Fig. 2(b), grouping inductor 
elements into differential struaures and placing the -Gm cells in close proximity of the oscUlaior 
nodes. Exploiting the higher Q of diflereniial inductors [8]. the VCO incorporates the -Gm ccU 
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shown in Fig. 2(c), shaping the rising and fiiUing edges by vhe PMOS transistors and hence low- 
oring the upconversion of l// noise [9]. SpcctreRF simulations indicate that, for a given power 
dissipation, inductor and frequency of oscillation, die proposed oscillator achieves twice the 
voltage swings and 12 dB lower phase noise than that in [6]. 

Each differential port of the VCO is buffered by an inductively-loaded differential pair. These 
buiTers (1) isolate the VCO from the long interconnects going to the PDs that would orhciwise 
introduce greater uncertainly in the oscillation frequency; (2) generate voltage swings above the 
supply voltage, thus driving the flipflops tifficiently; (3) isolate the VCO from the data edges cou- 
pled tiirough the phase detectors. 

Phase Detector The^ PD employs eight flipflops lo strobe the data at ! 2.5-ps intervals [Fig. 3(a)1. 
In a manner similar to an Alexander topology [10], the PD compares every two consecutive sam- 
ples by means of an XOR date, generating a net output current if 'die two arc unequal, i.e., if an 
edge has occurred. With no data transitions, the FFs produce equal outputs, and the V/I converters 
a zero current. 




Fig. 3. (a) Quarter-rate phase de^^ctor, (b) its charactbristic. 

The early-lace phase detection method used here exhibits a bang-bang characterisdc, forcing 
the CDR to align every other edge of the clock with the zero crossings of data under tlie locked 
condition. In reality, the melastable behavior of the flipflops leads to a fmite PD gain, allowing the 
clock edges to sustain some offset wlUi respect to the data 2ero crossings. Shown in Fig. 3(b) is the 
inpul/oulput characteristic of the PD together with the V/T converter, obtained by tmnsisior-levcl 
simulations while the circuit senses a 40-Gb/s random data stream and eight phases of the lO-GHz 
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clock. For a phase error less than ±2.5 p$, the PD displays a relatively constant gain of 1 00 /x A/ps, 
With an ideal V/I converter, a finite phase difference, A<f>y would still lead to injection of a finite 
current into the loop filter (similar to an ideal integrator), forcing the CDR circuit to lock with 
^ero static phase cnon Tlie ouiput resistance of the VA converter, on ihe other hand, results in 
lossy intcgradon, necessitating a small change in as the control voltage varies from mimmum 
to maximum. 

Even though the PD flipflops operate with a 10-GHz clock, proper sampling of 40-Gb/s data 
still requires fast recovery from the previom state and rapid acquisiuon of the present input. To 
this end, both a wide sampling bandwidth and a short clock transition time are necessary. Figure 




Fig. 4. (a) Proposed flipflop, (b) symmetric XOR gate and V/I converter. 

4(a) depicts the master-slave flipflop used in the phase detector. Here, NMOS switches Mi and Mz 
sample on the parasitic capacitances at nodes X and y when CK is high. Since the minimum 
input common-mode (CM) level is dictated by the gate-source voltage of M3-M4 and ^le headroom 
required by Iss, the sampling switches experience only an overdrive voltage of 0.5 V even i^CK 
reaches Vdd* failing to provide fast sampling. This issue is remedied by seuing ihe CM level of 
CK and UK equal to V^d, a choice affoixied by inductively-loaded stages following the VCO 
core. The peak value of CiC thus exceeds Vt>j> by 0.8 V. more than doubling the sampling speed 
of Afi and M2. The large clock swings also minimize the transition times. 
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With large clock swings available, the carrem switching in pairs Afs-Mo, and Mg-jWiu 

is accomplished by gate control rather than conventional source-coupled steering. The proposed 
topology offers two advantages: (1) since the tail current source is removed, Mn-Miz can be quite 
narrower, presenting a smaller capacitance to the VCO buffer; (2) since the drain currents of Mn- 
Mi3 are nor limited by a tall current source, these transistors experience "class AB" switcliing. 
drawing a large current ai the peak of the clock swing and providing greater voltage swings and a 
higher gain in the data path. 

The XOR gates used in the PD muse exhibit symmetry with respect to thehr two inputs and 
operate with a low supply voltage. Shown in Fig. 4(b) along with the V/I converter, the XOR gate 
is a modified version of that in [1 1], with transistors M2 and Ma forming local positive feedback 
loops and avoiding the reference .voltage necessary in the realization in [1 1]. The V/I converter 
copies the output curreni of the XOR, providing nearly rail-to-rail swings for the oscillator control 
line. 

C, Experimental Results 

The CDR ciicuir has been fabricated in a 0.18-Aim CMOS technology. Figure 5 shows a phoio of 
the die, which measures 1 .0 x 1.4 mm'*. The circuit i$ tested on a high-speed probe station with a 
40-Gb/s Amitsu random data generator providing the input. 




Fig. 5. Chip micrograph. 



Shown in Fig. 6 ai« the VCO tuning chafacteristie and free-running spectrum. The VCO 
provides a tuning range of 1.2 GHz with a free-running phase noise of -105 dBc/Hi at I -MHz 
olfset. Figure 7(a) depicts the CDR input and output v\^vefbims under locked condition in response 
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Veirl(V> 

Fig. 6, (a) VCO tuning range, (b) free-mxining spectrum 
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Table I . CDR pcrfoonaace summary. 
to a pseudo-random sequence oflengih 2*^ - 1, The demuliiplexed data experiences some ISI, but 
if fiirdier demultiplcxinfi is included on the same chip, the ISI can be tolerated. Figure 7(b) shows 
the recovered cloclc, suggesting an rms jitter of 1.756 ps and a pcak-lo-peak jitter of 9.67 ps. 
However, as shown in the inset, the oscilloscope itself sufTcis from rms and peak-to-pcak jitters of 
1.508 ps and 8.89 ps, respectively. Thus, thcCDR output contains a jitter of 0.9 ps^s and at most 
9.67 ps,pp. ^ The proformance of this work and some other previously-published CDR circuits is 
sumrriarized in Table 1. * 



' It is unclear wheiher and how chc peak-to-pcak values can be subtracted. 

^The power dissipation noted hen: fof the design in [2J excludes ihcir limiting amplifier and frequency dcicctOT 
contribution and was obtained through private communication with M. Reinbold. 
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Fig. 7. (a) Top to bottom: input data, one demultiplexed output data, and recovered clock (hori- 
zontal scale: 50 ps/div, vertical scale: 100 mV/div), (b) clock jitter measimeinent (horizontal scale: 
S ps/div for both histognuns). 
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Tht undersigned asaigpcc of the eotire inierest in tbo abovt-idcsntified siibjeciappKcadon 
hm^y appoiiiis 

G«»'ee H. Gates . Regisrratian No- 33,500 

Victor G- Cooper Regisiration No. 39,641 

. Karen S. CraBdy Rjcgistrarioa No, 39,927 

WfU5ani J; Wood Registration No, 42^6 

Richard B. Caies Rcgislr^tioii No. 36. 1 00 

Jason S- Fddmar Registration No. 39.1S7 

Bradley K. Lort2: Rji^gitftraiioil No. 45,472 

as its attorney to prosccoitc mis applicadoc aod lo transacT all business in the Utdicd Siaics Pateni 
and Trademark OfEce connected therewith, said sippQinlmeot to be to the cxdusiou of the 
inventors and their anomey(s) in accotdancd with tbk pmvisions of 37 CJ'\R. 3.71. 

An assignment of th& enrim interest in the above-identified subject SQ>plicfiiion: 

[ ] \vas iccozded on at icel/fiaing / 

[X] is submitted herewiib for recoidbig. 

Pleisc difccc all telephone calls to GeoigjcH. Gates ac (310) 641-8797 and all 
corn:3pocidcnoe relative to said application to the following address: 

Gaies & Cooper LLP 
Howard Hughes Center 
6701 Center Drive Wesft, State 10.50 
Los Angclesy Caliibtnia 90045 



tJC Docker Nd. ^3-200-1 
iSCVi^k^ Mo. GddZ 30448.1 lfi.US.7l 
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04-25-2006 01:53PM FROU-Gatss & Cooper LLP 



413106418798 



T-358 P. 029/029 F-660 
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ASSIGNEE: THE RJZJ2r)&NT: 



SigjiatUTc: 
Name; 
Title: 
Address: 




THE UNIVERSITY OF CAIJFQHNIa 



Douglau G. Crawford 
Lxccsnslng Officer 

Office of TcchnolQ^;y TViinKfcr 
1111 FrTOldinSttoci,5*Floor 
Oakaand, CaHfoiTiia 94607 
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